3 research outputs found

    Uncertainty-Estimation with Normalized Logits for Out-of-Distribution Detection

    Full text link
    Out-of-distribution (OOD) detection is critical for preventing deep learning models from making incorrect predictions to ensure the safety of artificial intelligence systems. Especially in safety-critical applications such as medical diagnosis and autonomous driving, the cost of incorrect decisions is usually unbearable. However, neural networks often suffer from the overconfidence issue, making high confidence for OOD data which are never seen during training process and may be irrelevant to training data, namely in-distribution (ID) data. Determining the reliability of the prediction is still a difficult and challenging task. In this work, we propose Uncertainty-Estimation with Normalized Logits (UE-NL), a robust learning method for OOD detection, which has three main benefits. (1) Neural networks with UE-NL treat every ID sample equally by predicting the uncertainty score of input data and the uncertainty is added into softmax function to adjust the learning strength of easy and hard samples during training phase, making the model learn robustly and accurately. (2) UE-NL enforces a constant vector norm on the logits to decouple the effect of the increasing output norm from optimization process, which causes the overconfidence issue to some extent. (3) UE-NL provides a new metric, the magnitude of uncertainty score, to detect OOD data. Experiments demonstrate that UE-NL achieves top performance on common OOD benchmarks and is more robust to noisy ID data that may be misjudged as OOD data by other methods.Comment: 7 pages, 1 figure, 7 tables, preprin

    Improved Target Signal Source Tracking and Extraction Method Based on Outdoor Visible Light Communication Using a Cam-Shift Algorithm and Kalman Filter

    No full text
    An improved Cam-Shift algorithm with a Kalman filter applied to image-sensor based on outdoor visible light communication (OVLC) is presented in this paper. The proposed optimized tracking algorithm is used to track and extract the region of the target signal source Light Emitting Diode (LED) that carries modulated information for data transmission. Extracting the target signal source LED area is the premise of an image-sensor-based VLC system, especially in outdoor dynamic scenes. However, most of the existing VLC studies focus on data transmission rate, visible light positioning, etc. While the actual first step of realizing communication is usually ignored in the field of VLC, especially when the transmitter (signal source LED) or the receiver (image sensor) is moving in a more complex outdoor environment. Therefore, an improved tracking algorithm is proposed in this paper, aiming at solving the problem of extracting the region of the target signal source LED accurately in dynamic scenes with different interferences so as to promote the feasibility of VLC applications in outdoor scenes. The proposed algorithm considers color characteristics and special distribution characteristics of the moving target at the same time. The image is converted to a color probability distribution map based on the color histogram of the target and adaptively adjusts the location and size of the search window based on the results obtained from the previous frame. Meanwhile, it predicts the motion state of the target in the next frame according to the position and velocity information of the current frame to enhance accuracy and robustness of tracking. Experimental results show that the tracking error of the proposed algorithm is 0.85 cm and the computational time of processing one frame is 0.042 s. Besides, results also show that the improved algorithm can track and extract the target signal source LED area completely and accurately in an environment of many interference factors. This study confirms that the proposed algorithm can be applied to an OVLC system with many interferences to realize the actual first step of communication in an image-sensor-based VLC system, laying foundations for subsequent data transmission and other steps

    Improving Training and Inference of Face Recognition Models via Random Temperature Scaling

    No full text
    Data uncertainty is commonly observed in the images for face recognition (FR). However, deep learning algorithms often make predictions with high confidence even for uncertain or irrelevant inputs. Intuitively, FR algorithms can benefit from both the estimation of uncertainty and the detection of out-of-distribution (OOD) samples. Taking a probabilistic view of the current classification model, the temperature scalar is exactly the scale of uncertainty noise implicitly added in the softmax function. Meanwhile, the uncertainty of images in a dataset should follow a prior distribution. Based on the observation, a unified framework for uncertainty modeling and FR, Random Temperature Scaling (RTS), is proposed to learn a reliable FR algorithm. The benefits of RTS are two-fold. (1) In the training phase, it can adjust the learning strength of clean and noisy samples for stability and accuracy. (2) In the test phase, it can provide a score of confidence to detect uncertain, low-quality and even OOD samples, without training on extra labels. Extensive experiments on FR benchmarks demonstrate that the magnitude of variance in RTS, which serves as an OOD detection metric, is closely related to the uncertainty of the input image. RTS can achieve top performance on both the FR and OOD detection tasks. Moreover, the model trained with RTS can perform robustly on datasets with noise. The proposed module is light-weight and only adds negligible computation cost to the model
    corecore